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Asymptotic  Theory  for  Weighted  Least  Squares  Estimators  in 
Aalen’s  Additive  Risk  Model 


Ian  W.  McKeague 


Abstract.  Let  h(t|Z,-]  be  the  conditional  hacard  function  for  the  survival  time 
of  an  individual  t  given  the  p-dimensional  cov^ate.^ process  Zi(t).  A^-study. 
inference  for  Aalen's  additive  risk  model  h(t|Zt)  =  it{{t)'ai(t),  where  ci  is  a  p- 
vector  of  unknown  hasard  functions.  The  theory  of  counting  processes  is  used 
to  obtain  weak  convergence  results  for  weighted  least  squares  estimators  of  the 
hasard  functions  and  the  cumulative  hasard  functions  based  on  continuous  data. 


Results  for  weighted  least  squares  estimators  based  on  grouped  data  are  also 


described. 
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1.  Introduction. 


The  proportional  haaards  regression  model  of  Cox  (1972)  for  the  analysis  of 
censored  survival  data  has  had  considerable  influence  on  the  theory  and  practice 
of  biostatistics.  In  recent  years  this  has  led  to  the  study  of  a  wide  variety  of 
hazard  function  based  regression  models  which  generalize  Cox’s  model  in  some 
way.  For  a  comprehensive  list  of  references  to  such  work  see  the  paper  of  Ritov 
and  Wellner  (1987,  in  these  proceedings). 

Let  h(t\Zi)  denote  the  conditional  hazard  function  for  the  survival  time  T, 
of  an  individual  t  given  the  covariate  process  Zi{t)  =  {Zii{t),. ..,  ZiAt)y,t>o. 
The  most  general  model  for  h{t\Zi)  that  seems  to  be  amenable  to  statistical 
analysis  is 

h(t|Z,)=a(t,Z,(t)),  (1.1) 


where  a  is  a  completely  general  (unknown)  function  of  time  and  the  state  of  the 
covariate  process.  Inference  for  this  model  has  been  studied  by  Reran  (1981) 
and  Dabrowska  (1987a,  1987b)  in  the  case  of  time-independent  covariate,  and 
by  McKeague  and  Utikal  (1987)  in  the  case  of  time-dependent  covariate  process. 
Although  this  model  is  attractive  from  a  theoretical  standpoint,  in  that  it  can 
encompass  goodness  of  fit  tests  for  any  particular  model,  its  large  sample  size 
requirements  make  it  difficult  to  apply  in  practice. 
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Aalan’i  Additive  Risk  Model 


If  we  assume  that  a(t,  0)  =  0  and  ignore  all  terms  higher  than  first  order 
in  Taylor’s  expansion  of  a(t,z)  about  z  =  0,  then  the  model  (1.1)  reduces  to 
Aalen’s  (1980)  additive  risk  model: 


hm)  =  '£a,it)Zi,it). 


(1.2) 


y=i 
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where  ai, . . .,  ap  are  unknown  functions  of  time.  Aalen’s  model,  which  we  study 
in  this  paper,  is  capable  of  providing  information  concerning  the  temporal  influ¬ 
ence  of  each  covariate  not  possible  to  obtain  using  Cox’s  model,  yet  it  does  not 
require  the  extremely  large  sample  size  needed  for  fitting  the  general  model  (1.1). 
McKeague  (1986)  studied  estimation  for  a  =  (ai,...ap)'  using  the  method  of 
sieves.  Recently,  Huffer  and  McKeague  (1987)  proposed  various  weighted  least 
squares  estimators  for  a  and  its  integrated  counterpart  A(-)  =  C((s)ds.  The 
purpose  of  the  present  paper  is  to  establish  weak  convergence  results  for  such 
weighted  least  squares  estimators  in  the  case  of  continuous  data.  Weak  conver¬ 
gence  results  for  grouped  data  based  weighted  least  squares  estimators  are  given 
in  McKeague  (1987).  In  the  grouped  data  case  only  the  total  number  of  uncen¬ 
sored  survival  times  falling  in  successive  time  intervals  and  the  corresponding 
total  times  at  risk,  for  all  levels  of  the  covariates,  are  assumed  to  be  available, 
whereas  in  the  continuous  data  case  treated  here  the  exact  values  of  the  un¬ 
censored  survival  times  T\,...,Tn  are  assumed  to  be  known.  Not  surprisingly, 
better  results  can  be  obtained  in  the  continuous  data  case. 

In  Section  2  we  describe  the  counting  process  formulation  of  Aalen’s  model. 
The  weighted  least  squares  estimators  based  on  continuous  data  are  defined  in 
Section  3.1  and  compared  with  their  grouped  data  analogues  in  Section  3.2. 
Proofs  of  the  main  weak  convergence  results,  stated  in  Section  3.1,  are  given  in 
Section  4. 
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2.  Aalen’s  model  in  the  cotmting  process  framework. 

Suppose  that  the  observable  portion  of  the  ith  individual’s  lifetime  Ti  is 
given  by  TJ  =  min(rj,Ci),  where  Cj  is  conditionally  independent  of  Ti  given  the 
covariate  process  Zi-  Also  suppose  that  T,  and  Q  are  absolutely  continuous.  The 
observations  consist  of  i.i.d.  triples  [Ti,  Si,  Kj),  t  =  1, . . .,  n,  where  Si  =  I[Ti  <  Q) 
and  Yi  is  the  process  Yi(t)  =  Zi[t)I(fi  >  t).  Now  let  Ni(t)  denote  the  indicator 
of  an  uncensored  failure  for  individual  i  prior  to  time  t: 

Ni[t)  =  I(fi  <  t,Si  =  1), 

and  suppose  that  each  covariate  process  Zi  is  left-continuous  with  right-limits. 
Under  Aalen’s  model  (1.2)  the  counting  process  Ni  has  intensity 

=  (2.1) 

}=i 

with  respect  to  the  right-continuous  filtration  Tt  =  <^(^«(a  ),Yi^{s-),0  <  s  < 
t,i  >  l,y  =  l,...,p).  Also,  no  two  of  the  counting  processes  Ni,...,Nn  jump 
simultaneously. 


.‘‘1 
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More  generally,  let  7V(t)  =  e  [0,1]  be  a  mnltivariate 

counting  process  with  respect  to  a  right-continuous  filtration  (^),  i.e.  N  is 
adapted  to  the  filtration  and  has  components  Ni  which  are  right-continuous  step 
functions,  sero  at  time  zero,  with  jumps  of  size  +1  such  that  no  two  components 
jump  simultaneously.  Let  A  be  the  compensator  of  N,  so  that  N  =  A.  +  M, 
where  M  =  (M\,. . .,  A/„)'  and  Mi,. . .,  Mn  are  local  martingales.  Suppose  that 
A  is  absolutely  continuous  (a.s.):  A(t)  =  f*  A(s)ds,  where  A  =  (Ax,...,A„)' 
and  Ai,...,A„  are  nonegative  predictable  processes  (intensity  processes).  The 
counting  process  version  of  Aalen’s  model  is  given  by 


A(t)  =  y(t)a(t). 


where  a  =  (oii, . . .,  dtp)'  is  a  vector  of  unknown  nonrandom  integrable  functions 
and  y(t)  =  (Yij(t))  is  an  n  x  p  matrix  of  covariate  processes  assumed  to  be  pre¬ 
dictable  and  locally  bounded.  For  fixed  to,0  <  to  <  1|  denote  Alt)  =  a(s)d3, 

where  to  <  t  <  1.  The  statistical  problem  is  to  estimate  a  and  A. 


3.  The  weighted  least  squares  estimators. 


3.1.  The  continuous  data  case. 


Suppose  that  the  entire  sample  paths  of  the  process  N  and  V  are  observed 
over  [0,  ij.  Aalen  (1980)  proposed  estimators  A  of  A  of  the  form 


A(t)=  fY-(s)dN{s), 
Jto 


where  Y  (a)  is  a  predictable  generalized  inverse  of  y(s).  In  the  case  p  =  1  with 


(>'"W)i^  =  (Eni(3))  .  »  = 

^k  =  l  ' 


=  l,...,n 


(where  1/0  =  0)  A  is  the  Nelson-Aalen  estimator  for  which  a  general  asymp¬ 
totic  theory  was  derived  by  Aalen  (1978).  For  p  >  1  Aalen  suggested  using 
Y~(s)  =  (y'(a)y’(3))“^y'(s),  where  here  and  in  the  sequel,  for  any  square  ma¬ 
trix  (or  scalar)  D,  D~^  denotes  the  inverse  of  i?  if  D  is  invertible,  the  zero  matrix 
otherwise.  Aalen  observed  that  this  choice  of  Y~  can  be  motivated  by  a  formal 
least  squares  principle  and  that  the  resulting  estimator 


A(t)  =  r(y'(s)y(s))-‘y'(s)div(s), 
Jta 


referred  to  <is  Aalen’s  least  squares  estimator,  probably  gives  reasonable  but  not 
optimal  estimates  of  A.  Recently  Huffer  and  McKeague  (1987)  suggested  using 
the  following  generalized  inverse  of  y(s); 


y-(s)  =  (y'(s)vV(s)y(s))-ir(s)v«^(s), 
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where  IV  (t)  is  the  nxn  diagonal  matrix  with  tth  diagonal  entry  VVi(t)  =  (Ai(t))  ^ 
and 

=  (3.4) 

J=1 

where  dy  is  a  predictable  estimator  of  ay.  The  estimator  dy  is  taken  to  be  the 
j'th  component  of  the  smoothed  least  squares  estimator 

=  K(^^yA(s),  (3.5) 

where  if  is  a  left-continuous  bounded  kernel  function  having  integral  1,  support 
[0,  l]  and  6„  >  0  is  a  bandwidth  parameter.  The  choice  of  generalized  inverse 
(3.3)  defines  what  we  call  the  weighted  least  squares  estimator 

i(0  =  f\Y'(s)W{s)Y{s))-^r(s)W(s)dN{s).  (3.6) 

Jto 

Observe  that  in  the  case  of  a  single  covariate  the  weighted  least  squares  estimator 
coincides  with  the  Nelson-Aalen  estimator  and  no  estimate  of  the  weights  ^i{t) 
is  needed.  In  order  to  obtain  an  estimator  of  a  a  itself  we  can  smooth  A  to 
obtain  a  smoothed  weighted  least  squares  estimator 

a{t)  =  ^  f  k(^-^)dA{s),  to<t<l,  (3.7) 

where  ^  is  a  bounded  kernel  function  having  integral  1,  support  [—1, 1]  and 
&n  >  0  is  a  bandwith  parameter.  In  the  case  of  a  single  covariate  the  smoothed 
weighted  least  squares  estimator  coincides  with  the  kernel  estimator  introduced 
by  Ramlau-Hansen  (1983). 

Let  D[to)  1]'’  denote  the  product  of  p  copies  of  the  Skorohod  space  Z>[fo.  l) 
and  endow  it  with  the  Skorohod  product  topology.  Also  denote 

fl 

i  =  l 

t=l 

Our  first  result,  which  gives  the  asymptotic  distribution  of  Aalen’s  least 
squares  estimator  (3.2),  employs  the  following  conditions: 

(Al)  (Asymptotic  stability).  For  ],k,l=  1, . . .,  p  there  exist  continuous  func¬ 
tions  Ljk  ^d  Rj),i  defined  on  [0,  l]  such  that 

sup  |L,fc(t)  -  Lyfc(t)|40, 

«do.i| 

sup  |Ryfc«(f)  -  Ryfc«(t)|.^0. 
tdo.l] 


l.j-j j 
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(A2)  (Lindeberg  condition).  For  each  j  =  1, . . .,  p 


n~^  sup  |yiy(t)|-^0. 

1  =  1....  ,n 

tdo.il 

(A3)  (Asymptotic  nondegeneracy  condition).  The  pxp  matrix  L(t)  =  {L,k(t)) 
in  (Al)  is  nonsingular  for  all  t  e  [0,  ij. 

Theorem  3.1.  Let  to  =  0-  Under  conditions  (A1)-(A3) 
y/n(A  —  A)  m  in  i)[0,  l]** 

where  m  is  a  p-variate  continuous  Gaussian  martingale  with  mean  sero  and 
covariance  function 


p  p  p 

Cov(my(t),m*(t))  =  EEE/ „(s)(L  *(s))y„(L  ‘(s))fc„a„(s)ds. 

u=l  »  =  !  «i=l 

In  order  to  establish  our  weatk  convergence  results  for  the  weighted  least  squares 
estimator  (3.6)  and  the  smoothed  weighted  least  squares  estimator  (3.7)  we  need 
the  following  additional  conditions: 

(Bl)  (Asymptotic  stability).  For  j,k  =  l,...,p  there  exist  continuous  func¬ 
tions  Vjk  defined  on  [0,  l]  such  that 

sup  |Vy*(t)  -  yyfc(t)|-^0. 

(B2)  (Bounded  covariates).  The  processes  Yii,i  >  i,j  =  1, ...,p  are  uni¬ 
formly  bounded. 

(B3)  (Asymptotic  nondegeneracy  condition).  The  pxp  matrix  V (f)  =  (V'yfc(f)) 
in  (Bl)  is  nonsingular  for  all  t  c  [0,  ij. 

(B4)  (Intensity  regularity  condition).  There  exists  5  >  0  such  that  if  Yij(t)  ^ 
0  for  some  y  =  1, . .  .,p  then  Aj(t)  >  S. 

(B5)  The  functions  cti, . . .,  ctp  are  continuous. 

Theorem  3.2.  Suppose  that  conditions  (Al),  (A3),  (Bl)  -  (B5)  hold,  b„  — • 
0,nbl  — •  oo  and  the  kernel  function  K  has  bounded  variation.  Let  0  <  to  <  1. 
Then 

\/n(A  —  A)  ^  m'  in  D[to,  Ij** 

where  m'  is  a  p-variate  continuous  Gaussian  mswtingale  with  mean  sero  and 
covariance  function 


Coy{m'j(t),m'^(t))  =  f  (V  ‘(a))yfcds. 
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Theorem  3.3.  Suppose  that  conditions  (Al),  (A3),  (Bl)  -  (B4)  hold,  bn  — * 
0,nbl  — ►  oo,nbn  —*  oo,nbn  —*  0  and  ai,...,ap  have  bounded  derivatives  in  a 
neighbourhood  oft,  where  0  <  to  <  t  <  1.  Then 

(n6„)i(5(t)  -  a(t)) 

converges  in  distribution  to  a  p-dimensional  normal  distribution  with  mean  sero 
and  covariance  matrix  V~^(t)  K^(s)ds. 

In  order  to  apply  the  weak  convergence  results,  to  obtain  hypothesis  tests 
and  confidence  bands  for  instance,  it  is  necessary  to  estimate  the  covariances  of 
the  limiting  Gaussian  distributions.  Using  Lenglart’s  inequality  it  can  be  shown 
that 


[\y-{s))j,(Y-(s))M(s) 
<=1  ■'*0 


is  a  uniformly  consistent  estimator  of  Cov(m'  (t),  m^(t))  over  [to,  ij,  where  Y  (a) 
is  given  by  (3.3).  Lemma  4.3(c)  shows  that  ^~^(f)  is  a  uniformly  consistent 
estimator  of  the  matrix  function  V~^(t)  over  [to,  ij. 

3.2.  The  grouped  data  case. 

Suppose  that  the  data  are  grouped  into  d„  time  intervals  jj"*,  r  =  1, . . .,  d„ 
which  partition  [0, "  1  and  depend  on  the  sample  sue  n.  It  is  natural  to  assume 
that  the  hazard  function  Oi , . . .,  oip  are  constant  over  each  interval,  giving  rise  to 
a  sequence  of  Aalen  models  indexed  by  n.  The  hazard  functions  Oj"*, . .  ..Op"* 
in  the  nth  model  are  constrained  to  be  the  piecewise  constant  approximations 
to  fixed  underlying  hazard  functions  Oi, . . .,  otp-. 


^  aj(s)d3  for  t  € 


where  tr  =  is  the  length  of 

Assume  that  the  covariate  processes  Zi  are  time-independent  and  the  total 
time  at  risk  and  number  of  uncensored  failures  are  known  for  each  interval  and 
covariate  level.  Least  squares  and  weighted  least  squares  estimators  of  a*’''  = 
(aj"*, . .  .ap"*)'  based  on  such  data  are  given  by 

=(/,.,  Y'(s)W(s}Y{s)dsy'  n*)W(s)dJV(s),  t  e 

W  If  w  If 

respectively,  where  V$'^(f)  is  the  n  x  n  diagonal  matrix  having  ith  diagonal  entry 
IV, (f)  =  (Aj(f))“‘  and  Ai(f)  is  an  estimate  of  the  intensity 

^!"’(0  =  i;a'"'(0y.‘'‘’(0- 


B* 
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It  is  reasonable  to  expect  that  if  A,(t)  is  chosen  appropriately  and  the  mesh  of 
the  partition  tends  to  zero  at  a  suitable  rate  as  n  — >  oo,  then  the 

estimate  A(  )  =  f.  afslds  satisfies  a  functional  central  limit  theorem  analogous 
to  Theorem  3.2  \ 

The  estimator  is  taken  to  be 


where  a^*  =  (c^i  i .  •  *,  cip*)*  is  piecewise  constant  approximation  to  the 
smoothed  least  squares  estimator 


where  is  a  left-continuous  kernel  function  having  integral  1,  support  (0,  l] 
and  6„  >  0  is  a  bandwidth  parameter.  Suppose  that  the  intervals  are  chosen 
so  that  to,0  <  to  <  1,  is  always  a  boundary  point  of  one  of  them.  Define 
A{  )  =  /g  Q*'*)(a)d3,  the  piecewise  linear  approximation  to  A. 

We  require  the  following  conditions: 

(Cl)  (Asymptotic  stability).  For  j,k,t  =  there  exist  functions  ly*, 

Rjkt  Vjk  defined  on  [0, 1]  such  that 

sup  |Iyfc(t)  -  Ljk{t)  \  =  op('(/min{£i,...,fd)) 


sup  |fl;fc(t)-/i;fc(t)|^0 

8«P  l^yfc(0  ~  ^yfc(*)l  =  Op(\/min(£i,..., fd)). 

“|o.i| 

(C2)  The  functions  ay,  Ljk,Vjk,j,  fc  =  1, . . ., p  are  Lipschitz. 

(C3)  The  bandwidth  parameter  6„  and  the  interval  lengths  satisfy 

6^nmin(fi,..  .,fd)  -•  oo 
-  0 

V'mm(^i, ..  .,fd) 
bn 

— ^  OO. 

min(fi,...,fd) 

Proposition  3.1.  (Grouped  d^ti  case).  Suppose  that  conditions  (A3),  (B2)- 
(B4),  (C1)-(C3)  bold  and  the  kernel  function  K  has  bounded  variation.  Let 
0  <  to  <  1.  Then  \/n(A  -  A)  converges  weakly  in  D[to,l]'’  to  the  p-variate 
Gaussian  martingale  m'  of  Theorem  3.2. 

We  refer  to  McKeague  (1987)  for  a  proof  of  this  proposition.  Note  that  the 
estimators  d,  d,  A  require  the  total  time  at  risk  in  each  interval  at  each  covariate 


■r.v^v 


8  Aalen’t  Additive  Risk  Model 

level  to  be  available.  When  only  interval  .ount  data  are  available,  survival  ana¬ 
lytic  techniques  do  not  apply.  In  that  case,  contingency  table  techniques  provide 
an  alternative  approach,  see  Koch,  Johnson  and  Tolley  (1972),  Bishop,  Fienberg 
and  Holland  (1975),  Gilula  (1986)  and  Kiefer  (1987). 

A  grouped  data  based  estimator  of  the  covariance  matrix  of  the  limiting  Gaussian 
martingale  m'  is  given  in  McKeague  (1987).  Although  it  is  possible  to  obtain  a 
pointwise  weak  convergence  result  for  the  least  squares  estimator  d:(t),  we  have 
not  been  able  to  do  so  for  the  weighted  least  squares  estimator  a{t),  or  any 
smoothed  version  of  it,  so  a  grouped  data  analogue  of  Theorem  3.3  is  not  yet 
available. 

4.  Proofs  of  Theorems  8. 1-3.3. 

The  following  lemma,  analogous  to  Lemma  4.2  of  McKeague  (1987),  is  stated 
without  proof. 

Lemma  4.1.  Under  conditions  (Al)  and  (A3) 

(a)  P(L(t)  is  invertible  for  all  t  e  [0,  l])  — *  1, 

(b)  sup  ||L“^(t)  —  L“^(t)||-^0,  where  ||  •  {|  denotes  operator  norm, 
ulo.ij 

Proof  of  Theorem  3.1.  JVom  (3.2)  and  (2.2)  we  can  write 

V^(A(t)  -  A(0)  =  ^('‘'(O  -  r  (4.1) 

Jo 

where 

A-<"'(f)  =  ^  r£-‘(s)y'(s)dM(s), 

V"  Jo 

J{t)  =  I(L{t)  is  not  invertible). 

By  Lemma  4.1  and  conditions  (Al),  (A3),  the  last  term  on  the  r.h.s.  of  (4.1) 
converges  uniformly  to  sero  in  probability.  It  remains  to  show  that  X*"*  m 
in  £>[0,  l]'’.  By  Lemma  4.1  there  exists  a  constant  C  >  0  such  that 

P{L-^{t)  =  U{t)  for  all  t  €  [0,  Ij)  -  1  (4.2) 

as  n  — *  oo,  where  U(t)  =  (C^yfc(t))  is  the  p  X  p  matrix  with  entries 


^3k(t)  =  \ 

(  C  otherwise. 


(O)yfcl  <  C 


Define  the  process 


and  note  that 


A'">(t)  =  -^  /“t/(s)r(s)dM(s) 

V"  Jo 


sup  ||X<"'(t)  -  A-<”'(OII'^0- 


Ian  W.  McKea^e 


The  jth  component  of  can  be  written 


The  local  martingales  M\,. . .,  Mn  are  local  square  integrable  martingales  on  the 
time  interval  [0,  l]  and  their  predictable  quadratic  variation  processes  are  given 
by 

<  Mi,  Mi  >  (t)  =  I  Xi{s)d3  and  <  Mi,Mj  >  (t)  =  0,  i  ^  j. 

Jo 

The  process  /T,!”*  is  locally  bounded  and  predictable  so  that  is  a  local 

square  integrable  martingale.  The  predictable  quadratic  variation  processes  of 
Xp"*  Jire  given  by 

<  >  (t)  =  r^Hl;\3)Hl:\3)Xi(s)ds 

•'0  <=1 

~  XI  ^(■s))fciii.^tiu)(s)au(s)ds 

U=lt/  =  1«>=1"'° 


+  op(l) 


Cov(m,(t),mfc(t))  as  n  — *  00, 


(by  (4.2)) 


by  the  asymptotic  stability  condition  (Al)  and  Lemma  4.1.  The  Lindeberg 
condition 

/'E^i;’(‘)"A,(f)/(|/f^’*'(t)l  >  e)dt^0 

X=1 

CIS  n  —X  oo,  for  each  «  >  0,  y  =  1, . . .,  p,  is  a  consequence  of 


sup  |#';>(t)|<Cpni  sup  |y.,(t)| 

»=1 . n  t  =  n 

te(0,l|  «<;|0,l| 


(by  condition  (A2)). 


Thus,  by  Rebolledo’s  central  limit  theorem  for  local  square  integrable  martingales 
in  the  form  given  by  Andersen  and  Gill  (1982,  Theorem  1.2),  it  follows  that 
m  in  £)[0,  IJp.  Combining  this  with  (4.3)  we  obtain  that  m  in 

Z?[0,  ll^,  which  completes  the  proof. 


V.V • 
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Lemma  4.2.  Suppose  that  conditions  (A1)-(A3),  (B5)  hold,  the  kernel  function 
K  has  bounded  variation,  6„  — ♦  0  and  nb^  — •  oo.  Let  0  <  to  <  1.  Then 

p 

sup  |qj  (t)  —  ay(<)|— *0. 

«e|«o,l| 

Proof.  Use  integration  by  parts  and  Theorem  3.1,  cf.  the  proof  of  Theorem 
2.2  of  McKeague  (1987). 

n 

Let  V[t)  denote  the  pxp  matrix  with  entries  Ujfc(t)  =  Y„(t)Y,k{t)W,(t). 

•=i 

Lemma  4.3.  Suppose  that  the  conditions  of  Theorem  3.2  hold.  Then 

(a)  P{V(t)  is  invertible  for  all  t  e  (fo,  Ij)  — ►  1, 

(b)  sup  |iy,(t)-w4t)|Ao, 

,n 

telto.l] 

(c)  sup  ||U-i(t)  -  V-i(t)||4o. 

t«lto,l| 

Proof.  U«!  conditions  (Bl)-(B5),  especially  the  bounded  covariates  con¬ 
dition  (B2),  and  Lemma  4.2.  The  proof  is  similar  to  the  proof  of  Lemma  4.3  of 
McKeague  (1987). 

Proof  of  Theorem  3.2.  We  can  write 

MA{t)  -  A(t))  =  A-("'(t)  -  r  J(5)dA(s),  (4.4) 

Jto 


where 


<"'w  =  ;^g;;>(s). 

*=i 

J(s)  =  I(y{3)  is  not  invertible). 

By  Lemma  4.3(a)  and  conditions  (Bl),  (B3),  the  last  term  on  the  r.h.s.  of  (4.4) 
converges  uniformly  to  zero  in  probability.  It  remains  to  show  that  X*"’  m'  in 
D\0, 1]P.  By  Lemma  4.3  and  conditions  (B2),  (B4),  (B5),  there  exists  a  constant 
C  >  0  such  that 


P(G';>  (s)  =  G';’  (s)  for  all .  =  1, . . .,  n,  s  e  [to,  l|)  -  1 


(4.5) 


k'here 


Ian  W.  McKeague 


g!;’(3)  = 


otherwise. 


Define  the  processes 


=  (4.7) 

Ar’(‘)  =  E  rA';'(^)<^A^.(3).  (4.8) 

Since  H,*"*  is  predictable  and  bounded,  .Yj"*  is  a  local  square  integrable  mai-tin- 
gale.  Also 

sup  ||X<'*'(£)-je<"'(t)||^0  as  n  — *  oo,  (4.9) 

*d‘o.i| 

where  X*”*  =  The  predictable  quadratic  variation  processes 

of  Xj"*, . . JYp”*  are  given  by 

<  >  (£)  =  rf]^<;>(s).tf<;>(s)A,(s)d3 

•'‘o  i=l 

=  E  E  f(^'H^)hu(V-'(^))kM^)ds  +  op(l), 

U=1  l)=l  ■'*'> 


where 


Thus,  by  Lemma  4.3  and  condition  (B2), 


Yj-'.jei"'  >  (£)^EE /V-‘(^)).«(^-‘(^)u»'"uv(3)ds 

u=l  <;  =  1  •'‘o 

=  [\v-^{s}Ud3. 

Jto 


Next,  since 


the  Lindeberg  condition 


sup  |/r‘'‘’(3)|  <  -^, 

1=  1,...  ,n  V  ^ 


r  E^‘;’(^)"A,(3)/(|/7.';'(3)|  >  e)d^O 

Jin 


'  *«■  •*  «•'  V  V  ’•*  ■‘JV’**"  'i/  'V  *«*•  "V*  "i^  *w^  "j*  'J‘ 
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as  n  — »  oo  for  each  e  >  0,  is  satisfied.  Thus,  by  ReboUedo’s  central  limit  theorem 
for  local  square  integrable  martingales,  in  Z?[0,  Ip,  and  by  (4.9),  X*"* 

has  the  same  limit  distribution.  This  completes  the  proof. 

Proof  of  Theorem  3.3.  Define  the  smoothed  version  of  a: 


On  Jto  \  On  / 


Since  ai, . . .,  Op  are  assumed  to  have  bounded  derivatives  in  a  neighbourhood  of 
t, 

{nln)^\Ht)-a-{t)\\  =  0{nbl)^  -^0. 

Using  (2.2),  (3.6)  and  (3.7)  and  Lemma  4.3(a)  we  can  write 

(ni„)i(a(t)-a*(t))  =  A'(">+op(l), 
where  X*"*  is  the  p-dimensional  random  vector  with  jth  component 

=  /‘/ri;'(s)dAf.(s). 

=  ^  G<;'(s), 

Vnbn 

(;(;)(s)  =  i:(t^-M^)U>;A(^)V«'.(3). 

^  '  fc=i 

As  in  the  proof  of  Theorem  3.2  we  truncate  Cj|y '  in  order  to  apply  Rebolledo’s 
central  limit  theorem.  This  gives  new  processes  Cjy  *  and  satisfying  (4.5)- 
(4.7).  The  Lindeberg  condition  (4.12)  is  satisfied  since 


and  nbn  — *  cxj.  Since  ai,...,ap  are  bounded  in  a  neighbourhood  It  of  £,  by 
Lemma  4.3, 

9up|V’u«(3)  -  V„„(3)|-^0,  (4.13) 

*€/t 


fWW 


•*.  ^  •*  ^  IT-  nr.  •  -  r.  •?  .  r  .  .  .  - 
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where  is  defined  by  (4.11).  Now  consider 

■'*»  1=1 

“n  „  =  !  •'to  V  On  / 

p  p 

-EE 

U=1 vsl 

+  Op(l) 

4  p^k^x)dx{v-^{t)u 

by  (4.13)  and  Lemma  4.3(c).  Application  of  Rebolledo’s  central  limit  theorem 

(see  Liptser  and  Shiryayev  (1980,  Remark  1))  to  Jf*"*  gives  the  result. 
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